[DEMO] Compiler Optimization by DBooots · Pull Request #3169 · KSP-KOS/KOS

DBooots · 2026-03-14T05:04:19Z

Demo only at this time

This PR adds an optimization stage to the compiler. This stage converts the Opcodes into a three address code interim representation, upon which various optimizing passes are performed. The interim representation is then converted back into opcodes and emitted to the program context as normal.
The optimizing passes currently implemented trim opcodes without changing any control flow, except to remove branches that are shown to be inaccessible.

Currently Implemented Passes

Suffix Replacement
Replaces CONSTANT: fields with the constant value, and replaces aliased SHIP: fields with the direct alias. This saves 1 opcode per ship alias replaced, and allows constant folding of the CONSTANT values.
Constant Propagation
Where a variable is assigned a constant, it replaces accesses of that variable with the constant (cognizant of global variables and triggers messing with variable values). This allows constant folding of the assigned values.
Constant Folding
Performs arithmetic, including scalar math functions, to reduce formulas to their simplest form. This includes the CONSTANT: fields from Suffix Replacement and the propagated constant variables from Constant Propagation. This step saves between 1 and 3 opcodes for every folding optimization that is performed.
Dead Code Elimination
Eliminates any branches where the condition is known to be false after propagating and folding constants.
Peephole Optimizations
This pass does a number of small optimizations:

Replaces string indexing against a lexicon with suffix accessing, when the string is a valid identifier. This saves 1 opcode for every index set or get.
Replaces parameterless suffix calls with a direct suffix access (since the suffix get opcode checks if it's a method and invokes it for free). This saves 2 opcodes for each suffix method call, but feels like cheating.
Replaces calls to VectorDotProduct() with a simple multiplication opcode. This saves 2 opcodes for every VectorDotProduct() that is performed.
Replaces double negation or double logical not operations with the original value. This probably doesn't come up often, but it saves 2 opcodes every time.
Replaces not->branch[true|false] with branch[false|true]. This saves 1 opcode every branch simplified.
Performs algebraic simplification (not necessarily of constants) to reduce the number of opcodes. E.g. A*B+A*C=A*(B+C), X*X*X = X^3, or -A+B = B-A . This saves multiple opcodes wherever possible.

Scope Simplification
Any scope that does not define any variables is eliminated. This saves 2 opcodes for each scope that is collapsed.

Next Steps

More testing! While I've added some unit tests, which run correctly, and I've reviewed the opcode output from those tests to see that the optimization passes are doing what they are supposed to, unit testing only covers simple cases. I need to compile larger and more complicated programs and see what breaks.
Further optimization passes. The OptimizationLevel.cs file includes the roadmap for future passes that could be implemented. So far only the O1 (Minimal) level is finished.
Adding optimization level specification to the compile function. Unit testing has direct access to the Compile Options fed to the compiler. We'll need to add support for in-game compilation level selection. I think I've got it hardcoded to use no optimizations in the interpreter (prioritizing responsivity over performance) and Minimal for general compilation (running or compiling programs), but this is only a development hack.

…OptimizationLevel enum to select level of optimizaiton; BaseIntegrationTest uses no optimization. Default the interpreter to no optimization since it won't be worth it there.

…IRBuilder generates basic blocks, which IREmitter can then convert back to kRISC opcodes. This initial implementation does not accomplish any optimization yet (the opposite sometimes) but all current tests pass on the outputted opcodes.

…code position that their Opcode comes from.

…that serve as metadata and access points.

… optimization methods.

…nalities with IRVariable but not be a subclass.

…or logical operation. This will be used for constant folding during compile. Hopefully the C# compiler will inline these back into the original Opcode methods, but the impact to execution should be minimal either way.

…R instructions.

…heir most reduced form. Invalid operations found during folding throw a compilation exception, which should report the problematic line and column in the source. Also add tests to confirm correct functionality (and improve current test suite for equivalent non-optimizing tests).

…d adding a virtual 'interim CPU' to accomplish the function call through the FunctionManager in order to avoid rewriting the function call logic.

…ng the AssemblyWalk attribute to discover all classes implementing IOptimizationPass.

…s passes.

…n be aliased with the alias (saves one opcode per access). This also replaces all constant() or constant: suffix accesses with the constant value, and does so in time for the constant folding pass.

…ally) every emitted opcode will have a location that makes sense.

…entation for whole-program optimizations (e.g. function inlining).

…d function and lock labels. Fix fallthrough jumps happening at the wrong time.

…optimizing of function code fragments. Add inspection methods to UserFunction and UserFunctionCollection to intercept the code fragments during the optimization pipeline.

…n just that part of the code.

…ining variable scoping.

…re something would be popped from the stack, but a BasicBlock's stack is empty at that point.

… that are no longer reachable.

…of pass sort indices to give more space for expansion.

…. Note that scopes are not fully implemented yet.

…that 0/0=1 or 0/X=0 for all X (including 0) instead of throwing an exception. But that's already undefined behaviour so this should be acceptable.

…MultipleOperandInstructions.

…ter passes to do targeted constant folding after making a change.

…n. This should be used carefully for in-place restructuring of IR instructions.

…truction tree more concise and easier to write.

…ons are equal if their operation and operands are equal. Temporary values are equal if the sequence of operations to return them are equal.

…eir operation is changed.

…m. This fixes programs with parameters breaking when they check for an argument marker.

…izations, as well as larger algebraic simplifications. See the list in OptimizationLevel.cs and the complete set of algebraic simplifications in PeepholeOptimizations.cs. Includes a unit test for these operations.

…e confused with class Scope or class VariableScope...

…es scope pops that can be replaced by incrementing the return depth.

…ompared not just by their string name, but also by their scope.

…uts using suffixing wherever possible.

…xceptions when storing a variable defined globally in another scope.

…special functions!

…utput.

…pe) that it produces.

…h constants wherever it can. Variables that are global, or that are written to within a LOCK statement are left alone, and variables that are written to in a function are not replaced after the first call to that function, unless subsequently reset. This works across the reaching definition, although the current implementation is a bit messy.

nuggreat · 2026-03-14T06:17:51Z

I don't think algebraic simplification can work without strong typing which kOS doesn't have. As while it should work for any scalar operations and most vector operations it likely breaks down once directions are are in the equations. This would be a basic example where it could break down for a vector operation x*x*x when x is a vector as x^3 is not a valid operation on a vector. And this below would be where it would give the wrong answer for combined vector and direction operations:

LOCAL a IS v(1,0,0).
LOCAL b IS r(0,90,0).
LOCAL c IS r(0,0,90).

PRINT c * a + b * a.  //should be about v(0,1,-1)
PRINT a * (c + b).    //algebraically expected to be about the same as above but is actually about v(0,1,0)

I have not compiled your branch with the optimizer to check and see if they actually do what I think they will or if would need to construct more elaborate cases. If you can successfully do type inference these cases can be caught but I would believe that is not easy to do.

DBooots · 2026-03-14T14:45:41Z

I don't think algebraic simplification can work without strong typing which kOS doesn't have. As while it should work for any scalar operations and most vector operations it likely breaks down once directions are are in the equations. This would be a basic example where it could break down for a vector operation x*x*x when x is a vector as x^3 is not a valid operation on a vector. And this below would be where it would give the wrong answer for combined vector and direction operations:
LOCAL a IS v(1,0,0).
LOCAL b IS r(0,90,0).
LOCAL c IS r(0,0,90).

PRINT c * a + b * a.  //should be about v(0,1,-1)
PRINT a * (c + b).    //algebraically expected to be about the same as above but is actually about v(0,1,0)
I have not compiled your branch with the optimizer to check and see if they actually do what I think they will or if would need to construct more elaborate cases. If you can successfully do type inference these cases can be caught but I would believe that is not easy to do.

Oh shoot, you're absolutely right. The unit test environment doesn't work with non-scalars, so I had gotten complacent and forgotten about the other types where add and mul are valid operators but do different things. I had considered adding a type inference system, which you're right will be needed for algebraic reduction. That will come with formalizing an SSA form and rewriting the reaching definitions used in the constant propagation pass, which is currently quite messy.
Another next step should also be to extend the kOS encapsulated types to work in the unit test environment so that the type inference system can actually be verified.

DBooots · 2026-03-16T02:21:14Z

Regarding type inference, I think the easiest option is to leverage the strongly-typed environment in which the code is being compiled. We would extend FunctionAttribute to add a (possibly optional) return type. While I'm at it, I would also add a property to indicate to the compiler that the method does not depend on game state and can be called at compile time if the inputs are immutable. For suffix accesses, if the incoming object type is known, the generic type argument of a given suffix can easily be found through reflection (perhaps cached in Structure.AddSuffix for snappier compiling). Those two easy-to-implement things should cover most of the work.
The harder part will be handling the Calculator classes. I think I would add a second method to Calculator to overload GetCalculator to work with two Types. I would also add abstract methods for the operations, taking two Types as operands. In implementation, these would mirror the structure of their 'real' methods and return the Type of the object it would expect to return.
Also, I take back my criticism of the limitations of the unit test environment not working for Vectors and Directions. I see now that that's to avoid invoking things that involve the UnityEngine, which don't work when the game isn't running.

…ons accordingly.

…rn type.

DBooots added 30 commits March 1, 2026 22:53

Adds tests for ensuring the output from the optimizing compiler. Add …

8e20927

…OptimizationLevel enum to select level of optimizaiton; BaseIntegrationTest uses no optimization. Default the interpreter to no optimization since it won't be worth it there.

Clean up IR building code. Ensure IRInstructions remember the source …

cd2448f

…code position that their Opcode comes from.

Align IRInstruction property names and accessibility. Add interfaces …

3f609ff

…that serve as metadata and access points.

Add concept of Extended Basic Blocks to help with future analysis and…

1344914

… optimization methods.

Add abstract IRVariableBase class so that IRTemp can share some commo…

28410f0

…nalities with IRVariable but not be a subclass.

Add convenience constructor for compilation exceptions arising from I…

a53f7b3

…R instructions.

Add constant folding for built-in scalar function calls. This require…

cd3104f

…d adding a virtual 'interim CPU' to accomplish the function call through the FunctionManager in order to avoid rewriting the function call logic.

Add infrastructure for automatic inclusion of optimization passes usi…

2f11d21

…ng the AssemblyWalk attribute to discover all classes implementing IOptimizationPass.

Move the InterimCPU ownership to IROptimizer to share resources acros…

159e068

…s passes.

Add suffix replacement pass. This replaces all ship: suffixes that ca…

2a71ecc

…n be aliased with the alias (saves one opcode per access). This also replaces all constant() or constant: suffix accesses with the constant value, and does so in time for the constant folding pass.

Store variable and constant source line and column so that (theoretic…

eb11e29

…ally) every emitted opcode will have a location that makes sense.

Offload optimization steps from Compiler. Add a new IRCodePart repres…

25b2897

…entation for whole-program optimizations (e.g. function inlining).

Add nonsequential label capability to BasicBlocks to account for weir…

c4da4df

…d function and lock labels. Fix fallthrough jumps happening at the wrong time.

Implement arg tests.

8f0ee70

Fix stack underflow when storing parameters.

3a5cf78

Rewrite IRCodePart and the flow from Compiler to Optimizer to enable …

6651a5f

…optimizing of function code fragments. Add inspection methods to UserFunction and UserFunctionCollection to intercept the code fragments during the optimization pipeline.

Give BasicBlocks unique IDs across the entire compilation, rather tha…

048d5bb

…n just that part of the code.

Add concept of dominance to basic blocks. This will be key for determ…

60c4458

…ining variable scoping.

Introduce IRParameter as a special type of IRValue, for instances whe…

cb8e5c6

…re something would be popped from the stack, but a BasicBlock's stack is empty at that point.

Add Dead Code Elimination pass. This pass eliminates any basic blocks…

294900f

… that are no longer reachable.

Adjust file structure and namespaces to be cleaner. Adjust numbering …

f727550

…of pass sort indices to give more space for expansion.

Outline planned sorting of optimization passes.

4d59836

Clean up code for DCE and downgrade it to IHolisticOptimizationPass.

91f179a

Add methods for storing variable at local, global, or ambiguous scope…

b0b85b9

…. Note that scopes are not fully implemented yet.

Add X/X=1 simplification to the constant folding pass. This will say …

80e6cc3

…that 0/0=1 or 0/X=0 for all X (including 0) instead of throwing an exception. But that's already undefined behaviour so this should be acceptable.

Add set access to IRInstruction interfaces, as well as indexing for I…

0afa7aa

…MultipleOperandInstructions.

Make ConstantFolding.AttemptReduction publicly accessible to allow la…

cf88539

…ter passes to do targeted constant folding after making a change.

DBooots added 23 commits March 11, 2026 16:41

Provide a method for overwriting the source location of an instructio…

a435ddb

…n. This should be used carefully for in-place restructuring of IR instructions.

Add a static utility class to make further passes that search the ins…

336ac48

…truction tree more concise and easier to write.

Implement equality checking for IR values and instructions. Instructi…

a879ba8

…ons are equal if their operation and operands are equal. Temporary values are equal if the sequence of operations to return them are equal.

Fix binary instructions losing appropriate commutative status when th…

c86d308

…eir operation is changed.

Add an argument marker to the stack before running a unit test progra…

852ece8

…m. This fixes programs with parameters breaking when they check for an argument marker.

Force clobber builtins as necessary.

73dee51

Add property to enumerate which blocks are dominated by a BasicBlock.

b6bb80c

Implement scope awareness for BasicBlocks. Class IRScope should not b…

ef43c4c

…e confused with class Scope or class VariableScope...

Add pass to remove unnecessary scope pushes and pops. This also remov…

9dbea38

…es scope pops that can be replaced by incrementing the return depth.

Implement scope awareness into IRVariableBase so that variables are c…

dfaa8ad

…ompared not just by their string name, but also by their scope.

Finish peephole optimization pass by implementing lex indexing shortc…

bd24143

…uts using suffixing wherever possible.

Clean up and restructure IRCodePart and its child classes.

281d3ae

Fix opcodebranch where the jump is relative instead of a label. Fix e…

5040e46

…xceptions when storing a variable defined globally in another scope.

Fix IRCodePart behaviour for triggers. It turns out they aren't just …

567a7ae

…special functions!

Harmonize IRVariableBase and subclass equality, hashing, and string o…

04ca8ce

…utput.

Make IRAssignment more strict by knowing the exact variable (with sco…

58f6e52

…pe) that it produces.

Improve debug string readability.

898bd3a

Fix collection changed while enumerating exception.

5b42033

Fix infinite loop when dumping the tree where there is a loop contained.

023de41

Fix equality not working between constants.

c59ce69

Patch scopes losing ancestry when an unused scope is collapsed.

7cfbd79

DBooots changed the title ~~Compiler Optimization~~ [DEMO] Compiler Optimization Mar 14, 2026

DBooots added 3 commits March 16, 2026 20:35

Add return type indicator to FunctionAttribute and tag all kOS functi…

b094ce3

…ons accordingly.

Fix Structure not counting as a valid return type.

8faf614

Add methods to check if functions are invariant and to get their retu…

9402dce

…rn type.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DEMO] Compiler Optimization#3169

[DEMO] Compiler Optimization#3169
DBooots wants to merge 56 commits intoKSP-KOS:developfrom
DBooots:compiler_optimizations

DBooots commented Mar 14, 2026

Uh oh!

nuggreat commented Mar 14, 2026

Uh oh!

DBooots commented Mar 14, 2026

Uh oh!

DBooots commented Mar 16, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

DBooots commented Mar 14, 2026

Demo only at this time

Currently Implemented Passes

Next Steps

Uh oh!

nuggreat commented Mar 14, 2026

Uh oh!

DBooots commented Mar 14, 2026

Uh oh!

DBooots commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

DBooots commented Mar 16, 2026 •

edited

Loading